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NOVEL MODIFIED MSP-1 NUCLEIC ACID SEQUENCES 
AND METHODS FOR INCREASING mRNA LEVELS AND PROTEIN 
EXPRESSION IN CELL SYSTEMS 



BACKGROUND OF THE INVENTION 

10 Field of the inversion 

The invention relates to heterologous gene expression. More particularly, the 
invention relates to the expression of malaria genes in higher eukaryote cell 
systems. 



15 Summary of the related art 

Recombinant production of certain heterologous gene products is often 
difficult in in vitro cell culture systems or in vivo recombinant production systems. 
For example, many researchers have found it difficult to express proteins derived 
from bacteria, parasites and virus in cell culture systems different from the ceil from 

20 which the protein was originally derived, and particularly in mammalian cell 
culture systems. One example of a therapeutically important protein which has 
been difficult to produce by mammalian cells is the malaria merozoite surface 
protein (MSP-1). 

25 Malaria is a serious heath problem in tropical countries. Resistance to 

existing drugs is fast developing and a vaccine is urgently needed. Of the number of 
antigens that get expressed during the life cycle of R falciparum, MSP-1 is the most 
extensively studied and promises to be the most successful candidate for 
vaccination. Individuals exposed to P. falciparum develop antibodies against 

30 MSP-1, and studies have shown that there is a correlation between a naturally 

acquired immune response to MSP-1 and reduced malaria morbidity. In a number 
of studies, immunization with purified native MSP-1 or recombinant fragments of 
the protein has induced at least partial protection from the parasite (Diggs et al, 
(1993) Parasitol. Today 9:300-302). Thus MSP-1 is an important target for the 



WO 99/20774 

development of a vaccine against P. falciparum. 
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MSP-1 is a 190-220 kDA glycoprotein. The C-terminal region has been the 
focus of recombinant production for use as a vaccine. However, a major problem in 
5 developing MSP-1 as a vaccine is the difficulty in obtaining recombinant proteins in 
bacterial or yeast expression systems that are equivalent in immunological potency 
to the affinity purified native protein (Chang et al., (1992) /. Immunol. 148:548-555.) 
and in large enough quantities to make vaccine production feasible. 

10 Improved procedures for enhancing expression of sufficient quantities of 

MSP-1 would be advantageous. 



2 
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BRIEF SUMMARY QF THE INVENTION 

The present inventioa provides improved recombiaant DNA compositions 
5 and procedures for increasing the mRNA levels and protein expression of the 
malarial surface antigen MSP-1 in cell culture systems, mammalian cell culture 
systems, or in transgenic mammals. The preferred protein candidate for expression 
in an expression system in accordance with the invention is a C-terminal derivative 
of MSP-1 having a DNA coding sequence with reduced AT content, and eliminated 
10 mRNA instability motifs and rare codons relative to the recombinant expression 
systems. Thus, in a first aspect, the invention provides a DNA sequence derived 
from the sequence shown in SEQ ID NO 2. This derivative sequence is shown in 
SEQ ID NO 1. 

In a second aspect, the invention provides a process for preparing a modified 
15 nucleic acid of the invention comprising the steps of lowering the overall AT 
content of the natural gene encoding MSP-1, eliminating all mRNA instability 
motifs and replacing all rare codons with a preferred codon of the mammary gland 
tissue, all by replacing specific codons in the natural gene with codons recognizable 
to, and preferably preferred by mammary gland tissue and which code for the same 
20 amino acids as the replaced codon. This aspect of the invention further includes 
modified nucleic acids prepared according to the process of the invention. 

In a third aspect, the invention also provides vectors comprising modified 
MSP-1 nucleic acids of the invention and a goat beta casein promoter and signal 
sequence, and host cells transformed with nucleic acids of the invention. 
25 In a fourth aspect, the invention provides transgenic non-human mammals 

whose germlines comprise a nucleic acid of the invention. 

In a fifth aspect, the invention provides a DNA vaccine comprising a 
modified MSP-1 gene according to the invention. 
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DESCRIPTION Of THE DRAWINGS 

Fig. 1 depicts the cDNA sequence of MSP-1 42 modified in accordance with the 
invention [SEQ ID NO 1] in which 306 nucleotide positions have been replaced to 
5 lower AT content and eliminate mRNA instability motifs while maintaining the 
same protein amino acid sequence of MSP-1 42 . The large letters indicate nucleotide 
substitutions. 

Fig. 2 depicts the nucleotide sequence coding sequence of the "wild type" or native 
10 MSP- 1 42 [SEQ ID NO 2]. 

Fig 3 is a codon usage table for wild type MSP-1 42 (designated "MSP wt" in the table) 
and the new modified MSP-1 42 gene (designated "edited MSP" in the table) and 
several milk protein genes (casein genes derived from goats and mouse). The 
15 numbers in each column indicate the actual number of times a specific codon 
appears in each of the listed genes. The new MSP-1 42 synthetic gene was derived 
from the mammary specific codon usage by first choosing GC rich codons for a given 
amino acid combined with selecting the amino acids used most frequently in the 
milk proteins. 

20 

Fig. 4a-c depict MSP-1 42 constructs GTC 479, GTC 564, and GTC 627, respectively as 
are described in the examples. 

Fig. 5 panel A is a Northern analysis wherein construct GTC627 comprises the new 
25 MSP-1 42 gene modified in accordance with the invention, GTC479 is the construct 
comprising the native MSP-1 42 gene, and construct GTC469 is a negative control 
DNA 

Fig 5 panel B is a Western analysis wherein the eluted fractions after affinity 
purifications numbers are collected fractions. The results show that fractions from 
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GTC679 the modified MSP-1 42 synthetic gene construct reacted with polyclonal 
antibodies to MSP-1 and the negative control GTC479 did not. 



Fig 6 depicts the nucleic acid sequences of OT1 [SEQ ID NO 3], OT2 [SEQ ID NO 4], 
5 MSP-8 [SEQ ID ON 5], MSP-2 [SEQ ID NO 6] and MSP1 [SEQ ID NO 7] described in the 
Examples. 

Fig 7 is a schematic representation of plasmid BC574. 
10 Fig 8 is a schematic representation of BC620. 
Fig 9 is a schematic representation of BC670. 

Fig 10 is a representation of a Western blot of MSP transgenic milk. 

15 

Fig 11 is a schematic representation of the nucleotide sequence of MSP42-2 [SEQ ID 
NO 8]. 

Fig 12 is a schematic representation of the BC-718. 

20 

Fig 13 is a representation of a Western blot of BC-718 expression in transgenic milk. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 



The patent and scientific literature referred to herein establishes the 
knowledge that is available to those with skill in the art. The issued US patents, 
5 allowed applications, published foreign applications, and references cited herein are 
hereby incorporated by reference. Any conflicts between these references and the 
present disclosure shall be resolved in favor of the present disclosure. 

The present invention provides improved recombinant DNA compositions 
10 and procedures for increasing the mRNA levels and protein expression of the 
malarial surface antigen MSP-1 in cell culture systems, mammalian cell culture 
systems, or in transgenic mammals. The preferred protein candidate for expression 
in an expression system in accordance with the invention is a C-terminal derivative 
of MSP-1 having a DNA coding sequence with reduced AT content, and eliminated 
15 mRNA instability motifs and rare codons relative to the recombinant expression 
systems. Thus, in a first aspect, the invention provides a DNA sequence derived 
from the sequence shown in SEQ ID NO 2. This derivative sequence is shown in 
SEQ IDNOl. 

In preferred embodiments, the nucleic acid of the invention is capable of 
20 expressing MSP-1 in mammalian cell culture systems, or in transgenic mammals at 
a level which is at least 25%, and preferably 50% and even more preferably at least 
100% or more of that expressed by the natural gene in mammalian cell culture 
systems, or in transgenic mammals under identical conditions. 

As used herein, the term "expression" is meant mRNA transcription 
25 resulting in protein expression. Expression may be measured by a number of 

techniques known in the art including using an antibody specific for the protein of 
interest. By "natural gene" or "native gene" is meant the gene sequence, or 
fragments thereof (including naturally occurring allelic variations), which encode 
the wild type form of MSP-1 and from which the modified nucleic acid is derived. 
30 A "preferred codon "means a codon which is used more prevalently by the cell or 



WO 99/20774 PCT/US98/22226 
tissue type in which the modified MSP-1 gene is to be expressed, for example, in 
mammary tissue. Not all codon changes described herein are changes to a preferred 
codon, so long as the codon replacement is a codon which is at least recognized by 
the mouse mammary tissue. The term "reduced AT content" as used herein means 
5 having a lower overall percentage of nucleotides having A (adenine) or T (thymine) 
bases relative to the natural MSP-1 gene due to replacement of the A or T containing 
nucleotide positions or A and/or T containing codons with nucleotides or codons 
recognized by mouse mammary tissue and which do not change the amino acid 
sequence of the target protein. 

10 

In a second aspect, the invention provides a process for preparing a modified 
nucleic acid of the invention comprising the steps of lowering the overall AT 
content of the natural gene encoding MSP-1, eliminating all mRNA instability 
motifs and replacing all rare codons with a preferred codon of mammary gland 

15 tissue, all by replacing specific codons in the natural gene with codons recognizable 
to, and prferably preferred by mammary gland tissue and which code for the same 
amino acids as the replaced codon. Standard reference works describing the general 
principals of recombinant DNA technology include Watson, J.D. et al, Molecular 
Biology of the Gene, Volumes I and II the Benjamin/Cummings Publishing 

20 Company, Inc. publisher, Menlo Park, CA (1987) Darnell, J.E. et al., Molecular Cell 
Biology, Scientific American Books, Inc., Publisher, New York, NY (1986); Old, R.W., 
et al., Principles of Gene Manipulation: An Introduction to Genetic Engineering, 2d 
edition, University of California Press, publisher, Berkeley CA (1981); Maniatis, T., 
et al., Molecular Cloning: A Laboratory Manual, 2 nd ed. Cold Spring Harbor 

25 Laboratory, publisher, Cold Spring Harbor, NY (1989) and Current Protocols in 

Molecular Biologxf, Ausubel et al., Wiley Press, New York, NY (1992). This aspect of 
the invention further includes modified nucleic acids prepared according to the 
process of the invention. 

Without being limited to any theory, previous research has indicated that a 
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conserved AU sequence (AUUUA) from the 3' untranslated region of GM-CSF 
mRNA mediates selective mRNA degradation (Shaw, G. and Kamen, R. Cell 
46:659-667). The focus in the past has been on the presence of these instability motifs 
in the untranslated region of a gene. The instant invention is the first to recognize 
5 an advantage to eliminating the instability sequences in the coding region of the 
MSP-1 gene. 



In a third aspect, the invention also provides vectors comprising modified 
MSP-1 nucleic acids of the invention and a goat beta casein promoter and signal 
10 sequence, and host cells transformed with nucleic acids of the invention. 

In a fourth aspect, the invention provides transgenic non-human mammals 
whose germlines comprise a nucleic acid of the invention. General principals for 
producing transgenic animals are known in the art. See for example Hogan et aL, 

15 Manipulating the Mouse Embryo: A Laboratory Manual, Cold Spring Harbor 

Laboratory, (1986); Simons et al, Bio/Technology 6:179-183, (1988); Wall et aL Biol 
Reprod. 32:645-651, (1985); Buhler et aL, Bio/Technology, 8:140-143 (1990); Ebert et aL, 
Bio/Technology 9:835-838 (1991); Krimenfort et al., Bio/Technology 9:844-847 (1991); 
Wall et aL, J. Cell Biochem. 49:113-120 (1992). Techniques for introducing foreign 

20 DNA sequences into mammals and their germ cells were originally developed in 
the mouse. See e.g., Gordon et aL, Proc. Natl Acad. Sci. USA 77:7380-7384, (1980); 
Gordon and Ruddle, Science 214: 1244-1246 (1981); Palmiter and Brinster, Cell 41: 
343-345, 1985; Brinster et aL, Proc Natl Acad Sci., USA 82:4438-4442 (1985) and Hogan 
et aL (ibid.). These techniques were subsequently adapted for use with larger 

25 animals including cows and goats. Up until very recently, the most widely used 

procedure for the generation of transgenic mice or livestock, several hundred linear 
molecules of the DNA of interest in the form of a transgenic expression construct 
are injected into one of the pro- nuclei of a fertilized egg. Injection of DNA into the 
cytoplasm of a zygote is also widely used. Most recently cloning of an entire 
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transgenic cell line capable of injection into an unfertilized egg has been achieved 
(KHS Campbell et al., Nature 380 64-66, (1996)). 

The mammary gland expression system has the advantages of high 
expression levels, low cost, correct processing and accessibility. Known proteins, 
5 such as bovine and human alpha- lactalbumin have been produced in lactating 
transgenic animals by several researchers. (Wright et al, Bio/Technology 9:830-834 
(1991); Vilotte et al, Eur. f. Biochem., 186:43 -48 (1989); Hochi et al., Mol Reprod. And 
Devel 33:160-164 (1992); Soulier et al., FEES Utters 297(1,2):13-18 (1992)) and the 
system has been shown to produce high levels of protein. 

10 

In a fifth aspect, the invention provides a DNA vaccine comprising a 
modified MSP-1 gene according to the invention. Such DNA vaccines may be 
delivered without encapsulation, or they may be delivered as part of a liposome, or 
as part of a viral genome. Generally, such vaccines are delivered in an amount 
15 sufficient to allow expression of the modified MSP-1 gene and to elicit an antibody 
response in an animal, including a human, which receives the DNA vaccine. 
Subsequent deliveries, at least one week after the first delivery, may be used to 
enhance the antibody response. Preferred delivery routes include introduction via 
mucosal membranes, as well as parenteral administration. 

20 

Examples 

Creation of novel modified MSP-1 42 gene 

25 A novel modified nucleic acid encoding the C-terminal fragment of MSP-1 is 

provided. The novel, modified nucleic acid of the invention encoding a 42 kD 
C-terminal part of MSP-1 (MSP-1 42 ) capable of expression in mammalian cells of 
the invention is shown in Fig. 1. The natural MSP-1 42 gene (Fig 2) was not capable 
of being expressed in mammalian cell culture or in transgenic mice Analysis of the 

30 natural MSP- 1 42 gene suggested several characteristics that distinguish it from 
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mammalian genes. First, it has a very high overall AT content of 76%. Second, the 
mRNA instability motif, AUUUA, occurred 10 times in this 1100 bp DNA segment 
(Fig 2). To address these differences a new MSP-1 42 gene was designed. Silent 
nucleotide substitution was introduced into the native MSP-1 42 gene at 306 
5 positions to reduce the overall AT content to 49.7%. Each of the 10 AUUUA mRNA 
instability motifs in the natural gene were eliminated by changes in codon usage as 
well. To change the codon usage, a mammary tissue specific codon usage table, Fig. 
3a, was created by using several mouse and goat mammary specific proteins. The 
table was used to guide the choice of codon usage for the modified MSP-1 42 gene as 
10 described above. For example as shown in the Table in Fig. 3a, in the natural gene, 
65% (25/38) of the Leu was encoded by TTA, a rare codon in the mammary gland. In 
the modified MSP-1 42 gene, 100% of the Leu was encoded by CTG, a preferred codon 
for Leu in the mammary gland. 

15 An expression vector was created using the modified MSP-1 42 gene by fusing the 
first 26 amino acids of goat beta-casein to the N-terminal of the modified MSP-1 42 
gene and a Sall-Xho I fragment which carries the fusion gene was subcloned into the 
Xhol site of the expression vector pCDNA3. A His6 tag was fused to the 3' end. of the 
MSP-1 42 gene to allow the gene product to be affinity purified. This resulted in 

20 plasmid GTC627 (Fig.4c). 

To compare the natural MSP-1 42 gene construct to the modified MSP-1 42 nucleic 
acid of the invention, an expression vector was also created for the natural MSP-1 42 
gene and the gene was added to mammalian cell culture and injected into mice to 
25 form transgenic mice as follows: 

Construction of the native MSP-1 42 Expression Vector 
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To secrete the truncated merozoite surface protein- 1 (MSP-1) of Plasmodium 
falciparum, the wild type gene encoding the 42KD C-terminal part of MSP-1 
(MSP-1 42 ) was fused to either the DNA sequence that encodes the first 15 or the first 
26 amino acids of the goat beta-casein. This is achieved by first PCR amplify the 
5 MSP-1 plasmid (received from Dr. David Kaslow, NIH) with primers MSP1 and 
MSP2 (Fig. 6), then cloned the PCR product into the TA vector (Invitrogen). The 
Bglll-Xhol fragments of the PCR product was ligated with oligos OT1 and OT2 (Fig, 
6) into the expression vector pCDNA3. This yielded plasmid GTC564 (Fig.4b), which 
encodes the 15 amino acid beta- casein signal peptide and the first 11 amino acids of 

10 the mature goat beta-casein followed by the native MSP-1 42 gene. Oligos MSP-8 and 
MSP -2 (Fig. 6) were used to amplify MSP-1 plasmid by PCR, the product was then 
cloned into TA vector. The Xhol fragment was exercised and cloned into the Xhol 
site of the expression vector pCDNA3 to yield plasmid GTC479 (Fig.4a), which 
encoded 15 amino acid goat beta-casein signal peptide fused to the wild-type MSP-1 42 

15 gene. A His6 tag was added to the 3' end of MSP-1 42 gene in GTC 564 and GTC 479. 

Native MSP-1 42 Gene Is Not Expressed In COS-7 Cells 

Expression of the native MSP gene in cultured COS-7 cells was assayed by transient 
20 transfection assays. GTC479 and GTC564 plasmids DNA were introduced into COS-7 
cells by lipofectamine (Gibco-BRL) according to manufacturer's protocols. Total 
cellular RNA was isolated from the COS cells two days post-transfection. The newly 
synthesized proteins were metabolically labeled for 10 hours by adding 35 S 
methionine added to the culture media two days-post transfection. 

25 

To determine the MSP mRNA expression in the COS ceils, a Northern blot was 
probed with a 32 P labeled DNA fragment from GTC479. No MSP RNA was detected 
in GTC479 or GTC564 transfectants (data not shown). Prolonged exposure revealed 
residual levels of degraded MSP mRNA. The 3? S labeled culture supernatants and 

11 
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the lysates were immunoprecipitated with a polyclonal antibody raised against MSP. 
Immunoprecipitation experiments showed that no expression from either the 
lysates or the supernatants of the GTC479 or GTC564 transfected cells (data not 
shown). These results showed that the native MSP-1 gene was not expressed in COS 
5 cells. 



Native MSP-l 4 ' 2 Cene is Not Expressed in the Mammary Gland of Transgenic Mice 

10 

The Sall-Xhol fragment of GTC479, which encoded the 15 amino acids of goat beta- 
casein signal peptide, the first 11 amino acids of goat beta-casein, and the native 
MSP-I42 gene, was cloned into the Xhol site of the beta-casein expressed in vector 
BC350. This yielded plasmid BC574 (Fig.7). A Sail-NotI fragment of BC574 was 

15 injected into the mouse embryo to generate transgenic mice. Fifteen lines of 

transgenic mice were established. Milk from the female founder mice was collected 
and subjected to Western analysis with polycolonal antibodies against MSP. None 
of the seven mice analyzed were found to express MSP-1 42 protein in their milk. To 
further determine if the mRNA of MSP-1 42 was expressed in the mammary gland, 

20 total RNA was extracted from day 11 lactating transgenic mice and analyzed by 
Northern blotting. No MSP-1 42 mRNA was detected by any of the BC 574 lines 
analyzed. Therefore, the MSP-1 42 transgene was not expressed in the mammary 
gland of transgenic mice. Taken together, these experiments suggest that native 
parasitic MSP-1 42 gene could not be expressed in mammalian cells, and the block is 

25 as the level of mRNA abundance. 

Expression of MSP in the Mammalian Cells 

Transient transfection experiments were performed to evaluate the expression of 
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the modified MSP-1 42 gene of the invention in COS cells. GTC627 and GTC479 
DNA were introduced into the COS-7 cells. Total RNA was isolated 48 hours 
post-transfection for Northern analysis. The immobilized RNA was probed with 
32 P labeled Sall-Xhol fragment of GTC627. A dramatic difference was observed 
5 between GTC479 and GTC627. While no MSP-1 42 mRNA was detected in the 
GTC479 transfected cells as shown previously, abundant MSP-1 42 mRNA was 
expressed by GTC627 (Fig. 5, Panel A). GTC 469 was used as a negative control and 
comprises the insert of GTC564 cloned into cloning vector PU19, a commercially 
available cloning vector. A metabolic labeling experiment with 35 S methionine 

10 followed by immunoprecipitation with polyclonal antibody (provided by D. Kaslow 
NIAID, NIH) against MSP showed that MSP-1 42 protein was synthesized by the 
transfected COS ceils (Fig.5, Panel B), Furthermore, MSP-1 42 was detected in the 
transfected COS supernatant, indicating the MSP-1 42 protein was also secreted. 
Additionally, using Ni-NTA column, MSP-1 42 was affinity purified from the 

15 GTC627 transfected COS supernatant. 

These results demonstrated that the modification of the parasitic MSP-1 42 gene lead 
to the expression of MSP mRNA in the COS cells. Consequently, the MSP-1 42 
product was synthesized and secreted by mammalian cells. 

20 

Polyclonal antibodies used in this experiment may also be prepared by means well 
known in the art (Antibodies: A Laboratory Manual, Ed Harlow and David Lane, 
eds. Cold Spring Harbor Laboratory, publishers (1988)). Production of MSP serum 
antibodies is also described in Chang et aL, Infection and Immunity (1996) 64:253-261 
25 and Chang et al., (1992) Proc Natl. Acad. Sci. USA 86:6343-6347. 

The results of this analysis indicate that the modified MSP-1 42 nucleic acid of the 
invention is expressed at a very high level compared to that of the natural protein 

13 



WO 99/20774 PCT7US98/22226 

which was not expressed at all. These results represent the first experimental 
evidence that reducing the AT % in a gene leads to expression of the MSP gene in 
heterologous systems and also the first evidence that removal of AUUUA mRNA 
instability motifs from the MSP coding region leads to the expression of MSP 
5 protein in COS cells. The results shown in Fig. 5, Panel A Northern (i.e. no RNA 
with native gene and reasonable levels with a modified DNA sequence in 
accordance with the invention), likely explains the increase in protein production. 

The following examples describe the expression of MSP1-42 as a native 
10 non-fusion (and non-glycosylated) protein in the milk of transgenic mice. 

Construction of MSP Trans gene 

To fuse MSP1-42 lo the 15 amino acid li-casein signal peptide, a pair of oligos, MSP203 

15 and MSP204 (MSP203: ggccgctcgacgccacxatgaaggLcctcataatlgcc 

tgtciggtggctciggccattgcagccgtcactccciccgtcai, MSP204: cgalgacggagggaglgacggcig 
caatggccagagccaccagacaggcaaltatgaggaccttcalgglggcgLcgagc), which encode the 15 amino acid - 
casein signal and the first 5 amino acid of the MSP I -42 ending at the Cla I site, was ligated with a 
Cla I-Xho I fragment of BC620 (Fig. 8) which encodes the rest ol the MSP 1-42 gene, into the 

20 Xho I site of the expression vector pCDNA3. A Xho I fragment of this plasmid (GTC669) was 
then cloned into the Xho I site of milk specific expression vector BC350 to generate B670 (Fig.9) 

Expression of MSP 1-42 in the milk of transgenic mice 

25 A Sal I-Not I fragment was prepared from plasmid BG670 and microinjected into the 

mouse embryo to generate transgenic mice. Transgenic mice was identified by extracting mouse 
DNA from tail biopsy followed by PCR analysis using oligos GTC17 and MSP101 (sequences of 
oligos: GTC17, GATTGACAAGTAATACGCTGTTTCCTC, Oligo MSP10L 
GGATTCAATAGATACGG). Milk from the female founder transgenic mice was collected at day 

30 7 and day 9 of lactation, and subjected to western analysis to determine the expression level of 
MSP- 1 -42 using an polyclonal anti-MSP antibody and monoclonal anti MSP antibody 5.2 (Dr. 
David Kaslow, NIH). Results indicated thai the level of MSP- 1-42 expression in the milk of 

14 
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transgenic mice was at 1-2 mg/ml (Fig. 10). 
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Construction of MSP I -42 glycosylation sites minus mutants 

5 Our analysis of the milk produced MSP revealed that the transgenic MSP protein was 

N-glycosylated. To eliminate the N-glycosylation sites in die MSP 1-42 gene, Asn. (N) at 
positions 181 and 262 were substituted with Gln.(Q). The substitutions were introduced by 
designing DNA oligos that anneal to the corresponding region of MSP1 and carry the AAC to 
CAG mutations. These oligos were then used as PCR primers to produce DNA fragments thai 
10 encode the N to Q substitutions. 

To introduce N262-Q mutation, a pair of oligos, MSPGYLYCO-3 
(C AGGG AATGCTGC AG ATC AGC) AND MSP42-2 (AATTCTCGAGTTAGTG 
GTGGTGGTGGTGGTGATCGCAGAAAATACCATG, FIG. 1 1), were used to PCR amplify 
15 plasmid GTC627, which contains the synthetic MSPI -42 gene. The PCR product was cloned into 
pCR2. 1 vector (Invitrogcn). This generated plasmid GTC7 16. 

To introduce N181-Q mutation, oligos MSPGLYCO-I (CTCCTTGTTCAGG 
AACTTGTAGGG ) and MSPGLCO-2 (GTCCTGCAGTACACATATGAG, Fig 4) were used to 
20 amplify plasmid GTC 627. The PCR product was cloned into pCR2. 1. Thus generated plasmid 
GTC700. 



The MSP double glycosylation mutant was constructed by the Following three steps: first, 
a Xho I-Bsm I fragment of BC670 and the Bsm 1-Xho I fragment of GTC716 is ligated into the 
25 Xho I site of vector pCR2. 1 . This resulted a plasmid that contain the MSP- 1 -42 gene with 

N262-Q mutation. EcoN I-Nde I fragment of this plasmid was then replaced by the EcoN I-Nde I 
fragment from plasmid GTC7 16 to introduce the second mutation, N18 l-Q. A Xho I fragment of 
this plasmid was finally cloned into BC350 to generate BC7 18 (Fig. 12). 

30 Transgenic expression of nonglycosylated MSP- 1 

BC7 18 has the following characteristics: it carries the MSPI -42 gene under the control of 
the B-casein promoter so it can be expressed in the mammary gland of the transgenic animal during 
lactation. Further, it encodes a 15 amino acid B-cascin leader sequence fused directly to MSP I -42. 
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so that the MS PI -42, without any additional amino acid at its N-terminal, can be secreted into the 
milk. Finally, because the N-Q substitutions, the MSP produced in the milk of the transgenic 
animal by this construct will not be N-glycosylatcd. Taken together, the transgenic MSP produced 
in the milk by BC7 1 8 is the same as the parasitic MSP. 

A Sall/Xhol fragment was prepared from plasmid BC7 18 and microinjected into mouse 
embryos to generate transgenic mice. Transgenic animals were identified as described previously. 
Milk from female founders was collected and analyzed by Western blotting with antibody 5.2. The 
results, shown in Figure 13, indicate expression of nonglycosylatcd MSP I at a concentration of 
0.5 to 1 mg/ml. 
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What is claimed is: 

1. A modified MSP-1 nucleic acid sequence, as shown in SEQ ID NO 1. 



5 2. A process for preparing a modified MSP-1 nucleic acid comprising the steps of 
lowering the overall AT content of the natural gene encoding MSP-1, eliminating 
all mRNA instability motifs and replacing all rare codons with a preferred codon of 
the mouse mammary gland. 

10 3. A modified MSP-1 nucleic acid prepared by the process according to claim 2. 

4. A vector comprising a modified MSP-1 nucleic acid according to claim 1 and a 
goat beta casein promoter and signal sequence. 

15 5. A host cell transformed with the vector according to claim 4. 

6. A transgenic non-human mammal comprising a vector according to claim 4. 

7. A transgenic non-human mammal comprising a modified MSP-1 nucleic 
20 acid according to claim 1. 

8 . A DN A vaccine comprising a modified nucleic acid according to claim 1 . 



25 
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i ocGgtCXctccOtccgtOv jGATAACATGcrQicOAG^TGc^G^CbAGr^ 
1>AI aVa I ThrProSerVa I II eAspAsnt I eLeuSer Ly s M eGl uAsnGl uTyr G 

56 AGGrKtiOmCETQ^GacQuaGs^ 

19> luVal LouTyr LeuLysPro LeuAl aGi yVal Tyr Ar gSer LeuLysLysGl n 

109 QrG3&(^C^CGlCkTX&^ 
21> Leu Gl uAsnAsnVa I MetThr PheAsnVal AsnVal LysAspl I eLeuAsnSer 

163 ogGitC^C^GcxsGga^ 

55^A rgPheAsnLysA rgGl uAsnPheLy sAsn Va I LeuGI uSe r AspLeu 1 1 ePr 

216 CmC&AG^TG<^ 

72* oTyr LysAspLeuThr Ser So r AsnTyr Val ValLysAspProTyr Ly s PheL 

269 T<^C^G^G^GuGAGATAA^ 
90> euAsnLysGI uLy s A rgAspLys Phe Leu Ser Ser Tyr AsnTyr 1 1 eLysAspSe 

324 C^TbATAcQbATATC^ChT^ 

108> r 1 1 eAspThrAspl I eAsnPheAt aAsnAspVal LeuGI yTyr Tyr Lys 1 1 eLe 

378 QrccGiiG^Qj^(^ 

12S> uSerGI uLysTyr Lys Se r AspLeuAspSe r 1 1 eLysLysTyr I leAsnAspLy 
432 GcAGbGA^GAACbAGAA 

144> sGI nGI yGI uAsnGl uLy sTyrLeuProPhe LeuAsnAsn 1 1 eGl uThr LeuTy 

486 OaQc GgtC&aGgataaGattc^t^^ OaGgt 

162> r LysThr Va I AsnAspLys I I eAspLeuPheVa I I I eHi s LeuGI uAI a Lys Va 
Ndel 

540 QrrG^GrACACATATGAGAAG?^QC! aacgtGgaGgtCaaGatCaaGgaGctGaa 

180> I LeuAsnTyr Thr TyrGI uLy s Se r AsnVa I Gl u Va I Ly s 1 1 e Ly s Gl u Leu As 

594 ttac GiOaG^c CatGcaGgaTaaQI tggc GgatttCaaGaaG^C^acaaOtt 

198> nTyrLeuLy sThr I I eGl nAs pLys LeuAl aAspPheLy s Ly sAs nAsnAs nPh 
648 CGT03GGATQx:G3ATGl<5OrAcGc^ 

216> eVa I Gl y II eAI aAspLeu Se r Th r AspTyr AsnHI sAsnAsnLeu LeuThr Ly 
702 GTICCTG^O^cCbGTATGCTOlTQ 

234>sPheLeuSer Thr GlyMetVa I PheGI uAsn LeuAl aLysThr Val Leu Se r As 
756 (EtGctG^TtcG^ 

252> n LeuLeuAspGI yAsn LeuGI nGI yMet LeuAsnl I e Se r Gl nHi sGI nCysVa 

810 (^G&AGc*Ghx7iccGcA(^ 

210> I Ly s Ly s Gl nCysP*oGl nAsn Se r Gl yCysPheArgHI s LeuAspGI uArgGI 
864 GsAGlCTAA&CTCh<E 

288* uGl uCysLy sCysLeu LeuAsnTyrLy sGI nGluGI yAspLy s CysVa I Gl uAsn 

9 19 ccGjvtcctacttgtaacc^G^ 

307> P r oAsnP roThr CysAsnGI uAs nAsnGI yGI yCysAspAl aAspAl aLy s CysThr G 

977 aGgaGgattcaggGagcaacggGaagaaGatcacGtctcaGtct CaaQxtgatt 

326* I uGl uAspSerGI ySerAsnGI yLysLys I leThr CysGI uCysThr LysProAspS 

1034 CTTATCCACTQlTXXATGGTATChTCTG r IXGT 
345> er TyrProLeuPheAspGl y I I ePheCysSer 
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1 GCAGTAACTCCTTOCGTAATT . AACATACTTTCTAAAATTGAAAATGAATP ' 
l^AlaValThrProSerVal lie oAsn I I eLeuSer Ly s I I eGl uAsnGI uTy . ,j 

E coNi j 73) 1 " 

56 AGGTTTTATATTTAAAACCTTTAGCAGGT^ 

19> luValLeuTyrLeuLysProLeuAl aGI yVal TyrArgSerLeuLysLysGl nLe 
111 AGAAAATAACGTTATGACATITAATGTTAATGTTAA 

37>uGI uAsnAsnValMstThrPheAsnValAsnValLysAsplloLeuAsnSorArg 
166 TTTAATAAAOSTGAAAATTTCAAAAATGTrra 

56> PheAsnLys ArgGI uAsnPheLysAsnVal LeuGI uSerAspLeul I eProTyrL 
221 AAGATTTAACATCAAGTAATTATGTTC^ 

74>ysAspLeuThr Ser SerAsnTyrVal Val LysAspProTyrLysPheLeuAsnLy 
276 AGAAAAAAGAGATAAATTCTOAAGCAGTTATAATTATATTAAGGA 

92^sGluLysArgAspLysPheLeuSerSerTyrAsnTyrl I eLys AspSer I leAsp 
331 ACGGATATAAATTTIGCAAATGATCTTCT^ 

111* Thr Asp 1 1 eAsnPheAl aAsnAspVa I LeuGI yTy rTy rLy e 1 1 eLeuSer Gl uL 
386 AATATAAATCAGATTTAGATTCAATTAAAAAAT^^ 

129> y sTy rLy sSerAspLeuAspSer 1 1 eLy sLysTyrl I eAsnAspLysGl nGI yGI 
441 AAATGAGAAATACCTlXXXOTITrAAAC^ 

147^ uAsnGI uLy sTyrLeuProPheLeuAsnAsnl I eGl uThr LeuTy rLysThr Val 
496 AATGATAAAAITCATITATTTGTAATOCATTT^ 

166>AsnAspLys 1 1 eAspLeuPheVal 1 1 eHI sLeuGI uAI aLysVal LeuAsnTyrT 
551 CATATGAGAAATCAAACXnAGAACOT 

184> hr TyrGl uLysSer AsnVal Gl uVal Lys 1 1 eLysGI uLeuAsnTyrLeuLysTh 

606 AATTCAAGACAAATIGGCAGATTTTAAAAAAAATAACAAT^ 

202> r 1 1 eGl nAspLy sLeuAl aAspPheLy sLysAsnAsnAsnPheVa I Gl y 1 1 eAI a 

661 GATTTATCAACAGATTATAACCATAATAACTTATTGACAAAGTT 

221> AspLeu Ser Th rAspTyrAsnHi sAsnAsnLeuLeuThr Ly sPheLeuSer Thr G 

716 GTATCGTTTTTGAAAYICTTGCTAAAAC 

239^ I yMel Val PheG! uAsnLeuAl aLysThr Val LeuSer AsnLeuLeuAspGt yAs 
771 CTTGCAAGGTATGTTAAACATTTCACAACACC^ 

257> nLeuGI nGI yMe ( LeuAsn I I eSer Gl nHi sGI nCysVal Ly sLy sGI nCysP ro 

826 CAAAATTCTGGATGTITCAGACATTTAGATGAAAGAGAAGAATC 

276> Gl nAsnSer Gl yCysPheArgHi sLeuAspGI uArgGI uGl uCysLy sCysLeuL 

881 TAAATTACAAACAAGAAGGTGATAAATGTGTTGAAAATCCA 

294> euAsnTyrLy sGI nGI uGl yAspLysCysVal Gl uAsnProAsnProThr CysAs 

936 OSAAAATAATCGTGGATGTGATGCAGATGC^ 

312> nGI uAsnAsnGI yGI yCysAspAl aAspAl aLy sCysThr Gl uGl uAspSer Gl y 

991 AGCAACGGAAAGAAAATCACATCTGAATGTACTAAACCT 

331> SerAsnGI yLy sLys I I eThr CysGI uCysThr Ly sProAspSerTyrProLeuP 

Pstl (1059) 

1046 TCGATGGTATTTTCTGCAGTCACCACCACCA 
349> heAspGI y I I ePheCysSer Hi sHi sHi sHi sHi sHi s- • • 
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Oligos used: 
OTt: 

TCG ACG AGA GCC ATG AAG GTC CTC ATC CTT GCC TGT CTG GTG GCT 
CTG GCC ATT GCA AGA GAG CAG GAA GAA CTC AAT GTA GTC GGT A, 



OT2: 

GAT CTA CCG ACT ACA TTG AGT TCT TCC TGC TCT CTT GCA ATG GCC 
AGA GCC ACC AGA CAG GCA AGG ATG AGG ACC TTC ATG GCT CTC G, 

MSP1: 

AATAGATCTGCAGTAACTCCTTCCGTAATTG, 
MSP2: 

AATTCTCGAGTTAGTGGTGGTGGTGGTGGTGACTGCAGAAATACCATC 
MSP8: 

TAACTCGAGCGAACCATGAAGGTCCTCATCCTTGCCTGTCTGGTGGCTCTGG 
CCATTGCA 
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Diagram of BC620 



FIGURE 8 
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Pvul (B928) 



FIGURE 9 
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Western Analysis of MSP transgenic milk. 

Lane 1, Molecular weight marker; lane 2, nontransgenic mice milk; lane 3, milk from 
BC628-146 transgenic mouse; lane 4-9, milk from BC670 transgenic mice. The blot was 
reacted with monoclonal antibody 5.2 against MSP. 



FIGURE 10 
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36 ATGAAGCTCCTCATAATTQCCTGTC • JlUJ ClCTOXXATTKaOCCO^^ 

1>MKVLIIACLVALAIAA V T P 8V I O N 
98 JMC C T G IC CW ATOAG^ 

2S> 1 LSKI ENEYEVLYLKPLAGVYRSL 

48> KKOLENNVMTFNVNVKOI LNSRFN 
241 cW*OOS»GMCITCAiyB^ ^ 

72> KR6NFKNVLESDL I P Y K D L T S8NY 

EcoNI (337) 

313 arrOOriCAAJlCMCCCCAC^^ 



96> VVKOPYKFtNKEKROKFLS S Y N Y 1 

120* KDSI 0 T 0 I NFANOVLGYYKI L S E K 
457 OTACAAGNSCCWCTCX^^ 

144> YKSOLOSIKKYINOKO O E N E K Y L P 

529 cntxa\w awcAraatf^^ 

168* FLN.NI ETLYKTVNOKI OLFVI HLE 

Ndo1 < 821 > .^^.^ 
601 GOCCAAGOTOCrcCAOTACACM^ 



192> AKVLQYTYEKSNVEVKI KELNYLK 
673 GACX»!rceaCXATAAOCX«^^ 

216> TIQOKLAOFKKNNNFVOI A P L S T 0 
745 TrACAACCJU^CAACtlTOCT ^ ^ 

240> YNHNNLLTKFLSTGMVF ENLAKTV 

Oeml (849) 

817 CCTGAGCAACCTGCTQGATO QAAAOCTGCAGQ OAATGCTGC MlJVTCtfUOCC*QCAOCAGTC7rC3TGAAG*W3C 

264> LSNLLDGNLQGMLOI SQHQCVKK 
888 K5TOTOOOCAGAACAOCO GATOCTTCAGRCACCTCXATGAGXaCGAGClAGTOCAAGT Ova. AiAACTA 

2B8^Q C PQNSGCFRHLDER EECKCLLNY 
958 CAAGCAtt^aaWMWSCTOKT^^ 

311> KQEGDKC VENPNPTCNENNGGCD 
1029 CCCATCXX^AOIX^ACCaftXX^CX^ GAAGCAACQ OAA/uSAAGATCACCT OCGAGTGTACCAAGCCT 

335>A OAKCTEEOSGSNGKKI TCECTKP 





3S9> OSYPLFOGI FCSHHHHHH- LED 



Xhol (U57) 




FIGURE 11 
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Pvul (8928) 



FIGURE 12 
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Western Analysis of MSP transgenic milk. 

Lane 1, Molecular weight marker, lane 2, nontiansgenic mice milk; lane 3, milk from 
BC628-146 transgenic mouse; lane 4-9, milk from BC670 transgenic mice. The blot was 
reacted with monoclonal antibody 5.2 against MSP. 

(57) Abstract 

The invention provides modified recombinant nucleic acid sequences (preferably DNA) and methods for increasing the mRNA levels 
and protein expression of malarial surface protein MSP-1 which is known to be difficult to express in cell culture systems, mammalian cell 
culture systems, or in transgenic animals. The preferred protein candidates for expression using the recombinant techniques of the invention 
are MSP-1 proteins expressed from DNA coding sequences comprising reduced overall AT content or AT rich regions and/or mRNA 
instability motifs and/or rare codons relative to the native MSP- 1 gene. 
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